1 N78-30469^ ,s(3 ^ 

APPLICATIONS AND LIMITATIONS OF VERY LARGE-SCALE 
INTEGRATION IN SAR AZIMUTH PROCESSING 

DENNIS G. KUHLER 

GOODYEAR AEROSPACE CORPORATION 
P.O. BOX 85 

LITCHFIELD PARK, ARIZONA 85340 
SUMMARY 

By using digital CCD's for data storage and single- chip multiplier accumu- 
lators, a low-power, high- density convolution processor may be designed. 

The memory tradeoffs involving chip design and speed are related to the 
operation of the processor and several optimum memory organizations are 
given. The major limitation of a convolution processor designed with CCD 
memory chips is the inability to operate in real time except for slow aircraft 
speeds or coarse resolutions. Two methods of summing the products are 
evaluated with respect to speed, power, and space requirements. A con- 
volution processor is designed, and the number of chips as well as the power 
and volume requirements are determined using 4-, 6-, and 8-bit data words. 

The processor is flexible because range samples may be traded for addi- 
tional azimuth samples by altering the control signals. The processor is 
also modular, and additional range or azimuth samples may be processed 
by adding more cards. Additional azimuth looks may be obtained by 
duplicating the single- look processor. 

1.0 INTRODUCTION 

The largest amount of hardware and the highest power requirement in a SAR 
data processor have been used to process the data in the azimuth or along- 
track dimension. The prefiltered azimuth data are stored in a corner- 
turning memory for the length of a synthetic aperture and convolved with the 
azimuth reference function to obtain processed azimuth samples. All range 
samples for a given azimuth position are processed before processing the 
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next group of range samples. The convolution operation performed on N 
azimuth samples may be expressed by the equation; 

n=(N+l)/2 

S Q (mT) = V hfoTjS^mT - nT) , (1) 

n=(N^l)/2 

where 

S 0 (">T) = complex output samples 

N = samples in synthetic aperture 
h(nT) = complex reference 
Sj(mT - nT) = complex input samples. 

From eq. (1), the number of complex multiplications required per output 
sample is N, and the number of summations is N - 1. The in-phase and 
quadrature samples representing digitized radar returns collected for a 
synthetic aperture are stored in the CCD memories, multiplied by the com- 
plex reference function, and accumulated to give the azimuth-compressed 
signal (see Fig. 1). Convolution processing requires a continuous stream of 
data with the oldest data sample being replaced by the newest sample after 
each azimuth output. 

2.0 CCD MEMORY CHARACTERISTICS 

The first element in the data path of Fig. 1 is the CCD memories, which 
store the samples until a synthetic aperture length of data has been collected. 
Three types of 65K CCD memories are presently available from different 
sources. The 65K CCD memories from Fairchild, F464, and Texas 
Instruments, TMS 3064, are organized as sixteen 4K shift registers which 
operate at a maximum speed of 5 MHz. The single- bit output is selected 
from one of the 16 shift registers by a 4-bit address. The Intel device, 

2462, is organized as 256 registers of 256 bits each, and the maximum clo'.k 
rate is 1 MHz. All three memories have a read- circulate, irite, and read- 
modify- write mode. W len performing the convolution operation, the read- 


V-7-2 




Fig. 1. Azimuth Convolution Processor 
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recirculate and write modes are used. The TMS 3064 and the F464 chips 

o 

require 12- V clock drivers, while T L clocks are usee* on the 2462. The 

time required to output all 65K bits from the CCD memories C T Q ) is 13. 1 * 

10" 3 s for the 3064 and the F464 when operating at a 5- MHz clock rate 

-3 

and 65. 5 x 10 s from the 2464 when operating at a 1-MHz clock rate. This 
is the minimum time to process all range samples for a particular azimuth 
position and provides a real-time azimuth resolution, W , of 

W A = 1. 25T 0 V a , (2) 

where V is the aircraft velocity, and 1. 25 azimuth samples per resolvable 

a 

element are processed to give 30- dB azimuth sidelcbes. 

Assuming an aircraft speed of 1000 it/s, the best azimuth resolution avail- 
able in real lime is 16. 4 ft from the F464 and TMS 3064 devices and 81. 8 ft 
from Intel's 2464. Better azimuth resolution may be obtained by recording 
the data on magnetic tape in veal time and using a lower speed for playback. 
Five- ft resolution may be c > t ained by reducing the* tape playback speed by a 
factor of 3. 3. This assumes enough complex ravutipliers are used so that 
the shift rate in the CCD memory is the limiting parameter. 

3.0 PROCESSOR MEMORY ORGANIZATION 

Many combinations of bit length, range samples, and azimuth samples for a 
CCD memory chip are available as shown in Table 1. To avoid complex 
timing or address record keeping, the product of the bits per data word and 
the range samples should equal some multiple of the shift register length. 
Because the chips are organized as single-bit input and output, several bits 
of the same word may be stored sequentially in each shift register. However, 
if parallel multipliers are used in the correlator, a serial- to- parallel con- 
verter must be used at each CCD output. Such an organization is desirable 
only if the number of range samples is less than the CCD shift register 
length, in which case part of the shift register would remain unused. The 
number of bits per data word, Ng, column 1 of Table 1, is determined by 
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TABLE 1 - OPTIMUM CHIP ORGANIZATIONS 
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the dynamic range requirements and will be considered fixed at 4, 6, or 8 
bits, although there is some interest in single-bit processors. The number 
of clock pulses per data word, Cp, determines how many bits of each data 
word are stored on a chip. The moct practical combinations are a single- 
clock pulse, Cp = 1, in which case a single bit is stored in each chip and 
N fi parallel chips are required; and C p = N R , in which case all bits of a 
data word are stored on a single chip and the data are output serially. The 
number of range samples, N R , stored for each azimuth sample (column 3 
of Table 1) is given by 




x m 



(3) 


v/here N gh is the number of bits pei shift register in the CCD, and m is 
the number of shift registers filled by a range line of data. The upper limit 
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of m is the number of shift registers in the CCD, 16 in the TMS 3064 and 
F464, and 256 in the Intel 2462. The number of range samples per PRF is 
given fay 


M KAS 

"r = w~ 

T 




where AS is the slant range swath and is the slant range resolution. 
The number of shift registers in a CCD required to store a range sample 
is given by combining eq. (3) and (4) and solving for m: 


m 


KaSC, 


W r N SH 


( 5 ) 


The number of azimuth samples stored in a CCD memory, column 4 of 
Table 1, is given by: 


N 


AZ 


N CCD 
iC • C, 


2 16 

n sh ' m 


( 6 ) 


R P 

where is the number of bits in a CCD (64K). The number of azimuth 

samples across the synthetic aperture, required to process the data 

to an azimuth resolution, W , is given by. 


N 


LS 


T K K^R\ 
V W a = 2W. 2 


0) 


The number of chips required to store a fraction C p /Ng of the data is 
determined by dividing eq. (7) by eq. (6): 





N R C p 


N 


CCD 


( 8 ) 


The total number of memory chips, N^, is then obtained by multiplying 
eq. (8) by N B /C p to give: 
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( 9 ) 



n r c p n b n ls n r n b 

- • - — - - - 

n ccd c p n ccd 


The number of chips required varies inversely as the range resolution, 
inversely as the square of the azimuth resolution, and directly as the range 
and range swath. 

4.0 PROCESSOR DESIGN 

As an example of azimuth processor design, a slant range swath of 10 mi is 
imaged at a maximum range of 30 mi with azimuth and range resolution of 
10 ft with I and Q data channels of 4 bits each. The displacement caused by 
range curvature may be determined from the equation: 

D^L , (10) 


which for an X-band radar, X = 0. 1 ft, is 1. 75 ft. This is sufficiently 
small so that range curvature correction is unnecessary. The value of m 
is, from eq. (5) when using = 4090, and C p = 1, 

KASCp 1.25 x 10 NMI x 6000 ft/NMf x 1 , 00 

m " N . W 4096 x 10 ft 1,83 

sh r 


Because m must be an integer, the next highest value, m - 2, is chosen. 
This increases the slant range swath from 10 NMI to 10. 9 NMI. The num- 
ber of range samples is, from eq. (3), 



N, 


SH 


m 


= 8192 . 


The number of azimuth samples stored per chip is, from eq. (6), 

XT N CCD 65 536 _ o 
AZ " N R . C p " 81§2 " 
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The number of azimuth samples across the synthetic aperture, is 
determined from eq. (7) to be: 


Aa (1. 2S) 2 X 30 NMI X 6000 ft/frMI x 0.1 ft 
2W a J 2 x (10 ft)* ~ 


140.6 




Because 8 azimuth samples are stored per chip, the number of azimuth 
chips required is 140/8 = 17. 5, so 18 azimuth chips are used, increasing 
the number of azimuth samples, N T <y to 144. The total number of CCD 
chips required for azimuth processing may be determined from eq. (9) to be: 



Nib-Nr-Nb 


N, 


CCD 


144 x 8192 x 8 

85H5S5 


= 144 


The power required is approximately 0. 25 W/chift or 36 W total, for the 
memory and 19 W for the 42 clock drivers. The memory and clock drivers 
require three printed- circuit boards. 

5.0 MULTIPLIER ACCUMULATOR 

The data are output from the memory to the multipliers (see Fig. 1). The 
product of the complex signal and the complex reference is formed and 
summed for all azimuth data point .. The four products which are formed 
to give the in-phase and quadrature components are: 

In-phase = S^R^ - SqRq 

Quadrature = S.Rq + SqRj 

The choices of integrated circuits to perform the complex multiplications are 
an 8-bit multiplier from Monolithic Memories and an 8-bit multiplier- 
accumulator from TRW. The multiplier chip requires an output latch as 
well as an adder stage and an accumulator to equal the capability of the 8- bit 
TRW multiplier accumulator. The combined space and power requirements 
of the multiplier, latch adder, and, accumulator combination are greater by 
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a factor cu 2 than the space and power required by the TRW chip, which uses 
1. 2 W to accomplish a multiply- accumulate operation in 70 ns. Because the 
multiplier accumulator operates at greater than a 10- MHz rate, a single 
chip can accumulate the products of either the in-phase or quadrature com- 
ponents of eq. (11) by forming the first product and then adding or sub- 
tracting the second. The multiplier accumulator output is a 19-bit sum of 
products which may accommodate sixteen 15-bit products without overflow. 
Fifteen-bit products result from the multiplication of two 8-bit words. 

Sixty- four products of 11 bits each may be accumulated without overflow 
when there are 6 bits in the signal and the reference, and 256 products of 
7 bits each when there are 4 bits in the signal and the reference. If the data 
rate to the compression filter is slowed sufficiently so that 256 products of 
7 bits each may be stored in the accumulator, only a minimum number of 
accumulators are required to do the convolution indicated in eq. (1). 

Assuming a 5- MHz shift rate in the CCD’s, a single accumulator may store 
the in-phase or quadrature products for the eight azimuth samples stored in 
a CCD memory. The required number of multiplier accumulators is then 
144/8 x 2 = 36. The required power is 1.2 W/chip x 36 chips = 43 W. 

An additional 8 W is required to multiplex the CCD outputs to form the two 
products indicated in eq. (11) for the in-phase and quadrature channels. 

The final operation in performing the convolution operation is to add the out- 
puts the multiplier accumulators (18 I and 18 Q accumulators). The num- 
ber v its output from each accumulator in the example used is 11 (16 
products of 7 bits each). The power required to accumulate the final 
products is 20 W, and the number of chips required to sum the accumulator 
outputs is 260, consisting of latches and adders. The output is a 16-bit sum 
of products in I and Q occurring each 1.6 n s. The power and chip require- 
ments for each function are listed in Table 2. The reference function 
generator is not included in the table, but a reasonable estimate would be 
two cards and a total power consumption of 10 W. 
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TABLE 2 - SUMMARY OF PROCESSOR REQUIREMENTS 
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6.0 CONCLUSIONS 


Modern CCD n. jmories and signal processing chips provide the capability 
to design and build radar signal processors using a fraction of the volume 
and power required by present processors. As a comparison with current 
technology as exemplified by the SAPPHIRE processor, the size of an azi- 
muth processor employing CCD memories and signal processing chips 
would occupy seven percent of the volume and use five percent of the power. 
It would have applications where near -real -time processing was satisfactory 
and where low power, size, and volume were desirable. 

Other desirable attributes include a reduction in the number of individual 
printed circuit card types and the number of interconnections which enhance 
the reliability and reduce the cost. The processor is flexible because range 
and azimuth tradeoffs may be performed within each chip as given in Table 1 . 
The processor is also modular with the basic module consisting of a single 
processor card which includes both the memory and the multiplier accumu- 
lator. As a result of fewer parts and simpler design, the life cycle costs 
of operation and maintenance are also reduced. 
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